Search CORE

310 research outputs found

Family- and population-based designs identify different rare causal variants

Author: Brad G Kurowski
C Goldberger
EE Eichler
ET Cirulli
Hua He
J Ott
KA Frazer
LA Almasy
Lili Ding
Lisa J Martin
NJ Risch
NJ Schork
P Hintsanen
Q Yang
S Guhathakurta
S Knight
S Lopez-Leon
Tesfaye M Baye
TI Pollin
Xue Zhang
Y Cui
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Both family- and population-based samples are used to identify genetic variants associated with phenotypes. Each strategy has demonstrated advantages, but their ability to identify rare variants and genes containing rare variants is unclear. To compare these two study designs in the identification of rare causal variants, we applied various methods to the population- and family-based data simulated by the Genetic Analysis Workshop 17 with knowledge of the simulated model. Our results suggest that different variants can be identified by different study designs. Family-based and population-based study designs can be complementary in the identification of rare causal variants and should be considered in future studies

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

SNPs in cancer research and treatment

Author: AJ Black
CB Foster
CS Carlson
D Botstein
DO Stram
DW Hein
EH Choi
ES Lander
EY Krynetski
H C Erichsen
HK Kroemer
HM Colhoun
I Cascorbi
J Duan
J Yin
K Golka
K Lindblad-Toh
KE Lohmueller
L Frisse
L Kruglyak
L Kruglyak
L Le Marchand
MJ Daly
N Risch
N Risch
NJ Risch
P Garred
PC Sabeti
PE Bonnen
PL Paris
S J Chanock
S Wacholder
S Wacholder
SB Gabriel
SJ London
TR Rebbeck
W Kalow
Publication venue: Nature Publishing Group
Publication date
Field of study

Crossref

PubMed Central

A Robust Statistical Method for Association-Based eQTL Analysis

Author: AL Dixon
AL Price
B Devlin
C Ouyang
Christine Hackett
CJ Hoggart
David Marshall
DH Alexander
DJ Balding
DJ Schaid
DJ Schaid
DL Remington
EE Schadt
ES Lander
GA Satten
GW Snedecor
HC Fung
HM Kang
I Mackay
J Cockram
J Couzin
J Peng
J Simón-Sánchez
J Yu
JK Pritchard
JK Pritchard
KG Ardlie
KM Weiss
Lin Wang
Lindsey Leach
LR Cardon
LR Cardon
M Morley
M Slatkin
MH Wang
MI McCarthy
Minghui Wang
MM Iles
Momiao Xiong
N Hubner
N Patterson
Ning Jiang
NJ Risch
NJ Risch
NL Johnson
PH Westfall
R Chakraborty
R McGinnis
RS Spielman
RS Spielman
S Campino
Tianye Jia
VG Cheung
VG Cheung
W Astle
W Satake
WJ Ewens
X Zhu
YT Wang
Z Luo
ZB Zeng
Zewei Luo
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

Background: It has been well established that theoretical kernel for recently surging genome-wide association study (GWAS) is statistical inference of linkage disequilibrium (LD) between a tested genetic marker and a putative locus affecting a disease trait. However, LD analysis is vulnerable to several confounding factors of which population stratification is the most prominent. Whilst many methods have been proposed to correct for the influence either through predicting the structure parameters or correcting inflation in the test statistic due to the stratification, these may not be feasible or may impose further statistical problems in practical implementation. Methodology: We propose here a novel statistical method to control spurious LD in GWAS from population structure by incorporating a control marker into testing for significance of genetic association of a polymorphic marker with phenotypic variation of a complex trait. The method avoids the need of structure prediction which may be infeasible or inadequate in practice and accounts properly for a varying effect of population stratification on different regions of the genome under study. Utility and statistical properties of the new method were tested through an intensive computer simulation study and an association-based genome-wide mapping of expression quantitative trait loci in genetically divergent human populations. Results/Conclusions: The analyses show that the new method confers an improved statistical power for detecting genuin

CiteSeerX

Crossref

University of Birmingham Research Portal

Directory of Open Access Journals

PubMed Central

Shrunken methodology to genome-wide SNPs selection and construction of SNPs networks

Author: A Schlicker
AC Syvanen
AJ Brookes
BEG Rothberg
BS Srinivasan
D Devos
E Bair
EK Khlestkina
H Liao
H Liao
H Schwender
H Schwender
HC Erichsen
J Park
JC Latourelle
JN Hirschhorn
JY Dai
K Ozaki
M Ashburner
M Cargill
Michael Ng
N Risch
NJ Schork
P Shannon
R Sachidanandam
R Tibshirani
RA Gibbs
RC Gentleman
S Purcell
X Gao
Yang Liu
Z Wang
Z Xu
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Crossref

Springer - Publisher Connector

PubMed Central

The NEI/NCBI dbGAP database: Genotypes and haplotypes that may specifically predispose to risk of neovascular age-related macular degeneration

Abstract Background To examine if the significantly associated SNPs derived from the genome wide allelic association study on the AREDS cohort at the NEI (dbGAP) specifically confer risk for neovascular age-related macular degeneration (AMD). We ascertained 134 unrelated patients with AMD who had one sibling with an AREDS classification 1 or less and was past the age at which the affected sibling was diagnosed (268 subjects). Genotyping was performed by both direct sequencing and Sequenom iPLEX system technology. Single SNP analyses were conducted with McNemar's Test (both 2 × 2 and 3 × 3 tests) and likelihood ratio tests (LRT). Conditional logistic regression was used to determine significant gene-gene interactions. LRT was used to determine the best fit for each genotypic model tested (additive, dominant or recessive). Results Before release of individual data, <it>p</it>-value information was obtained directly from the AREDS dbGAP website. Of the 35 variants with <it>P </it>< 10-6 examined, 23 significantly modified risk of neovascular AMD. Many variants located in tandem on 1q32-q22 including those in <it>CFH</it>, <it>CFHR4</it>, <it>CFHR2</it>, <it>CFHR5</it>, <it>F13B</it>, <it>ASPM </it>and <it>ZBTB </it>were significantly associated with AMD risk. Of these variants, single SNP analysis revealed that <it>CFH </it>rs572515 was the most significantly associated with AMD risk (P < 10-6). Haplotype analysis supported our findings of single SNP association, demonstrating that the most significant haplotype, GATAGTTCTC, spanning <it>CFH</it>, <it>CFHR4</it>, and <it>CFHR2 </it>was associated with the greatest risk of developing neovascular AMD (<it>P </it>< 10-6). Other than variants on 1q32-q22, only two SNPs, rs9288410 (<it>MAP2</it>) on 2q34-q35 and rs2014307 (<it>PLEKHA1</it>/<it>HTRA1</it>) on 10q26 were significantly associated with AMD status (<it>P </it>= .03 and <it>P </it>< 10-6 respectively). After controlling for smoking history, gender and age, the most significant gene-gene interaction appears to be between rs10801575 (<it>CFH</it>) and rs2014307 (<it>PLEKHA1</it>/<it>HTRA1</it>) (<it>P </it>< 10-11). The best genotypic fit for rs10801575 and rs2014307 was an additive model based on LRT. After applying a Bonferonni correction, no other significant interactions were identified between any other SNPs. Conclusion This is the first replication study on the NEI dbGAP SNPs, demonstrating that alleles on 1q, 2q and 10q may predispose an individual to AMD.</p

Crossref

Harvard University - DASH

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Replicating genotype-phenotype associations

Author: A Goris
A Helgason
A Herbert
AD Skol
AG Clark
AL Price
AO Edwards
B Funke
BA Van Den
BM Neale
C Dina
CA Haiman
CJ Groves
D Altshuler
D Rosskopf
DA Hinds
DH Hall
DM Maraganore
DR Myers
E Zeggini
G Kirov
GR Chandak
GS Hageman
HM Colhoun
J Clarimon
J Flint
J Gudmundsson
J Rosand
JA Todd
JL Haines
JN Hirschhorn
JP Hugot
JP Ioannidis
JP Ioannidis
JP Ioannidis
K Roeder
KE Lohmueller
LJ Scott
LT Amundadottir
M Angell
M Economou
M Horikoshi
M Mutsuddi
M Patterson
M Yeager
MJ Farrer
ML Freedman
N Freimer
N Risch
N Rothman
NB Freimer
NJ Bray
NJ Risch
R Saxena
R Saxena
R Sladek
RE Straub
RH Duerr
RJ Klein
RJ Loos
RS Spielman
S Gretarsdottir
S Wacholder
S Wacholder
S Zollner
SF Field
SF Grant
TA Manolio
Y Li
Y Ogura
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2007
Field of study

Peer Reviewedhttp://deepblue.lib.umich.edu/bitstream/2027.42/62757/1/447655a.pd

Crossref

Oxford University Research Archive

The University of Manchester - Institutional Repository

Deep Blue Documents at the University of Michigan

Three Ways of Combining Genotyping and Resequencing in Case-Control Association Studies

Author: B Li
B Li
BE Madsen
CH Buzin
D Altshuler
E Zeggini
Garrett P. Larson
GP Larson
GP Patil
H Meijers-Heijboer
JA Longmate
Jeffrey A. Longmate
JK Pritchard
Justin C. Fay
K Curtin
L Toernqvist
M Li
MI McCarthy
MI McCarthy
NJ Risch
Q Liu
S Zhao
SL Slager
SP Dickson
SS Sommer
Steve S. Sommer
TE Fingerlin
TG Krontiris
Theodore G. Krontiris
V Bansal
W Song
Z Chen
Publication venue: Public Library of Science
Publication date: 01/01/2010
Field of study

We describe three statistical results that we have found to be useful in case-control genetic association testing. All three involve combining the discovery of novel genetic variants, usually by sequencing, with genotyping methods that recognize previously discovered variants. We first consider expanding the list of known variants by concentrating variant-discovery in cases. Although the naive inclusion of cases-only sequencing data would create a bias, we show that some sequencing data may be retained, even if controls are not sequenced. Furthermore, for alleles of intermediate frequency, cases-only sequencing with bias-correction entails little if any loss of power, compared to dividing the same sequencing effort among cases and controls. Secondly, we investigate more strongly focused variant discovery to obtain a greater enrichment for disease-related variants. We show how case status, family history, and marker sharing enrich the discovery set by increments that are multiplicative with penetrance, enabling the preferential discovery of high-penetrance variants. A third result applies when sequencing is the primary means of counting alleles in both cases and controls, but a supplementary pooled genotyping sample is used to identify the variants that are very rare. We show that this raises no validity issues, and we evaluate a less expensive and more adaptive approach to judging rarity, based on group-specific variants. We demonstrate the important and unusual caveat that this method requires equal sample sizes for validity. These three results can be used to more efficiently detect the association of rare genetic variants with disease

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Accuracy of Predicting the Genetic Risk of Disease Using a Genome-Wide Approach

Author: A Robertson
A Robertson
ACJW Janssens
AF Mcrae
AJ Chamberlain
Beatriz Villanueva
BJ Hayes
D Habier
D Shriner
DE Reich
DR Cox
DS Falconer
Hans D. Daetwyler
HHH Goring
JC Barrett
JC Dekkers
JC Venter
JK Pritchard
JN Hirschhorn
John A. Woolliams
KG Ardlie
M Lynch
M Sargolzaei
Michael Nicholas Weedon
MN Weedon
N Risch
NJ Yi
NR Wray
P Bijma
PDP Pharoah
SZ Xu
TH Meuwissen
TR Solberg
W Valdar
Publication venue: Public Library of Science
Publication date: 01/01/2008
Field of study

Background - The prediction of the genetic disease risk of an individual is a powerful public health tool. While predicting risk has been successful in diseases which follow simple Mendelian inheritance, it has proven challenging in complex diseases for which a large number of loci contribute to the genetic variance. The large numbers of single nucleotide polymorphisms now available provide new opportunities for predicting genetic risk of complex diseases with high accuracy. Methodology/Principal Findings - We have derived simple deterministic formulae to predict the accuracy of predicted genetic risk from population or case control studies using a genome-wide approach and assuming a dichotomous disease phenotype with an underlying continuous liability. We show that the prediction equations are special cases of the more general problem of predicting the accuracy of estimates of genetic values of a continuous phenotype. Our predictive equations are responsive to all parameters that affect accuracy and they are independent of allele frequency and effect distributions. Deterministic prediction errors when tested by simulation were generally small. The common link among the expressions for accuracy is that they are best summarized as the product of the ratio of number of phenotypic records per number of risk loci and the observed heritability. Conclusions/Significance - This study advances the understanding of the relative power of case control and population studies of disease. The predictions represent an upper bound of accuracy which may be achievable with improved effect estimation methods. The formulae derived will help researchers determine an appropriate sample size to attain a certain accuracy when predicting genetic ris

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Edinburgh Research Explorer

Wageningen University & Research Publications

SRUC - Scotland's Rural College

AWclust: point-and-click software for non-parametric population structure analysis

Author: A Bowcock
AL Price
B Devlin
B Devlin
B Devlin
B Wu
CJ Hoggart
CJ Hoggart
D Falush
ES Lander
G Guillot
H Tang
J Corander
J Corander
J Marchini
J Mountain
JK Pritchard
Joshua D Starmer
KJ Dawson
L Excoffer
LL Cavalli-Sforza
M Bauchet
M Freedman
M Shriver
N Liu
N Patterson
N Rosenberg
NJ Risch
O Lao
PM McKeigue
R Kaeuffer
R Tibshirani
S Purcell
S Purcell
SL Guthery
X Gao
Xiaoyi Gao
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Abstract Background Population structure analysis is important to genetic association studies and evolutionary investigations. Parametric approaches, e.g. STRUCTURE and L-POP, usually assume Hardy-Weinberg equilibrium (HWE) and linkage equilibrium among loci in sample population individuals. However, the assumptions may not hold and allele frequency estimation may not be accurate in some data sets. The improved version of STRUCTURE (version 2.1) can incorporate linkage information among loci but is still sensitive to high background linkage disequilibrium. Nowadays, large-scale single nucleotide polymorphisms (SNPs) are becoming popular in genetic studies. Therefore, it is imperative to have software that makes full use of these genetic data to generate inference even when model assumptions do not hold or allele frequency estimation suffers from high variation. Results We have developed point-and-click software for non-parametric population structure analysis distributed as an R package. The software takes advantage of the large number of SNPs available to categorize individuals into ethnically similar clusters and it does not require assumptions about population models. Nor does it estimate allele frequencies. Moreover, this software can also infer the optimal number of populations. Conclusion Our software tool employs non-parametric approaches to assign individuals to clusters using SNPs. It provides efficient computation and an intuitive way for researchers to explore ethnic relationships among individuals. It can be complementary to parametric approaches in population structure analysis.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Carolina Digital Repository

A random forest approach to the detection of epistatic interactions in case-control studies

Author: A Bureau
A Collins
AG Heidema
AM Glazier
BA McKinney
CT Tsai
E Lander
HC Fung
J Hoh
J Marchini
J Millstein
J Simon-Sanchez
JH Moore
JK Pritchard
L Breiman
L Kruglyak
L Tiret
MD Ritchie
MP Martin
MR Nelson
N Chatterjee
NJ Risch
R Culverhouse
R Diaz-Uriarte
R Jiang
R Jiang
RJ Klein
RO Duda
Rui Jiang
SM Williams
TM Phuong
Wanwan Tang
Wenhui Fu
X Chen
Xuebing Wu
Y Ye
Y Zhang
YM Cho
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background The key roles of epistatic interactions between multiple genetic variants in the pathogenesis of complex diseases notwithstanding, the detection of such interactions remains a great challenge in genome-wide association studies. Although some existing multi-locus approaches have shown their successes in small-scale case-control data, the "combination explosion" course prohibits their applications to genome-wide analysis. It is therefore indispensable to develop new methods that are able to reduce the search space for epistatic interactions from an astronomic number of all possible combinations of genetic variants to a manageable set of candidates. Results We studied case-control data from the viewpoint of binary classification. More precisely, we treated single nucleotide polymorphism (SNP) markers as categorical features and adopted the random forest to discriminate cases against controls. On the basis of the gini importance given by the random forest, we designed a sliding window sequential forward feature selection (SWSFS) algorithm to select a small set of candidate SNPs that could minimize the classification error and then statistically tested up to three-way interactions of the candidates. We compared this approach with three existing methods on three simulated disease models and showed that our approach is comparable to, sometimes more powerful than, the other methods. We applied our approach to a genome-wide case-control dataset for Age-related Macular Degeneration (AMD) and successfully identified two SNPs that were reported to be associated with this disease. Conclusion Besides existing pure statistical approaches, we demonstrated the feasibility of incorporating machine learning methods into genome-wide case-control studies. The gini importance offers yet another measure for the associations between SNPs and complex diseases, thereby complementing existing statistical measures to facilitate the identification of epistatic interactions and the understanding of epistasis in the pathogenesis of complex diseases.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central